Towards An Automatic Identipeation Of Topic And Focus

نویسندگان

  • Eva Hajicová
  • Petr Sgall
چکیده

The purpose of the paper is (i) to substantiate the claim that the output of an automatic analysis should represent among other things also the hierarchy of toplc-focus articulation, and (ii) to present a general procedure for determining the toplc-focus articulation in Czech and English. (i) The following requirements on the output of an automatic analysis are significant: (a) in the output of the analysis it should be marked which elements of the analyzed sentence belong to its topic and which to the focus$ (b) the scale of communicative dynamism (CD) should also be identified for every representation of a meaning of then -ly-zed sentence, since the degrees of CD correspond to the unmarked distribution of quantifier scopes in the semantic interpretation of the sentences (c) the analysis should also distinguish toplcless sentences from those hav~ng a topic, which is relevant for the scope of negation. (ii) For an automatic recognition of topic, focus and the degrees of CD, two ~ oints are crucial: a) either the input language has (a considerable degree of) the so-called free word order (as in Czech, Russian), or its word order is determined mainly by the grammatical relations (as in English, Prench); (b) either the input is spoken discourse (and the recognition procedure includes an acoustic analysis), or written (printed) texts are analyzed. In accordance with these points, a general procedure for determining topic, focus and the degrees of CD is formulated for Czech and English, with some hints how the preceding context can be taken into account. le We distinguish between the l@vel of l~uistic,meaning (de Saussure s and HJe~mslev s "form of content", Cosieru s "Bedeutung", others "literal meaning") and its interpretation in the sense of truth-conditional, intensional logic (see Materna and Sgall, 1980~ Sgall, 1983). Por some purposes of automatic treatment of natural l~n~uage (including machine translation) it is sufficient if the output of the procedure of analysis is more or less identical with the representation of the (linguistic) meaning of the sentence. For other purposes, such as that of full natural language comprehension, it is necessary to go as far as the semantic (truth-conditional) i~terpretation, using a notation that includes variables, operators, parentheses and similar means. The topic-focus articulation (TFA) is understood as one of the hierarchies of the level of meaning, whose other two hierarchies are that of dependency syntax (close to case grammar) and that of coordination (and …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation

Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...

متن کامل

Automatic QoS-aware Web Services Composition based on Set-Cover Problem

By definition, web-services composition works on developing merely optimum coordination among a number of available web-services to provide a new composed web-service intended to satisfy some users requirements for which a single web service is not (good) enough. In this article, the formulation of the automatic web-services composition is proposed as several set-cover problems and an approxima...

متن کامل

A review of text mining approaches and their function in discovering and extracting a topic

Background and aim: Four text mining methods are examined and focused on understanding and identifying their properties and limitations in subject discovery. Methodology: The study is an analytical review of the literature of text mining and topic modeling.  Findings: LSA could be used to classify specific and unique topics in documents that address only a single topic. The other three text min...

متن کامل

A New Approach towards Precise Planar Feature Characterization Using Image Analysis of FMI Image: Case Study of Gachsaran Oil Field Well No. 245, South West of Iran

Formation micro imager (FMI) can directly reflect changes of wall stratums and rock structures. Conventionally, FMI images mainly are analyzed with manual processing, which is extremely inefficient and incurs a heavy workload for experts. Iranian reservoirs are mainly carbonate reservoirs, in which the fractures have an important effect on permeability and petroleum production. In this paper, a...

متن کامل

مقایسۀ کاربرد انواع روش‎های ارزیابی دسترس‎پذیری وب‎سایت‎ها مطالعۀ موردی: وب‎سایت وزارتخانه‌های دولت جمهوری اسلامی ایران)

Purpose: The present research aims to comparatively study different methods for evaluating the accessibility of websites and analyze the results of case study concerning websites of ministries of Iranian government, in order to indicate the strengths, weaknesses, and differences in evaluation findings by applying each of website accessibility methods. Methodology: In this paper, initially the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1985